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ABSTRACT 

A study Iras made to derive an equation for predicting 
the "subjective" textual information contained in a. text of material 
written in the English language. Specifically, this investigation 
describes, by a mathematical equation, the relationship between the 
"subjective" information content of written textual material and the 
relative number of errors committed by a learner when asked to 
predict, letter by letter, the content of given textual material. The 
relationship shows that the subjective information of a given text 
for a specific learner is directly proportional to the number of 
wrongly-guessed signs made by that learner. This is expressed 
mathematically by: 1=3. IE; Where: I-Information in Bits; E=Number of 
wrongly guesfeed signs. The application of Shannon is guessing 
procedure (1951) in this study permits the measurement of the 
"subjective" information of a given text for a specific learner. The 
derived equation permits the measurement of information in teims of a 
value that is dependent not only on the inherent qualities of the 
subject mfatter, ^ut also on the internal state of the learner. 
(Author/WCM) 
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I - 3,1 E 

Where: I = Information in Bits 

E = Number of wrongly guessed signs 
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The application of Shannon's guessing procedure (1951) in this study permits the measurement of the 
subjective information of a given text for a specific learner. Unlike senseless texts, the subjective information of a 
meaningful text varies from learner to learner. Therefore, the derived equation permits the measurement of 
information in terms of a value that is dependent not only upon the inherent qualities of the subject matter, but also 
upon the internal 5 tate of the learner. 
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THE EHPIP.ICAL DERIVATION OF EQUATIONS FOR PREDICTING 
SUBJECTIVE TEXTUAL INFORMATION 



Introduction 

Objective 

This stgdy was designed to empirically derive an equation fdr 
predicting the subjective textual information contained in a text of 
material written In the English language. Speclflcallyy this Investl-* 
gation describes, by a mathematical equation, tht relationship between 
the subjective information content of written textual material and the 
relative number of errors committed by a learner when asked co predict, 
letter by letter, the content of the textual material* 



Rationale 

Parameters pertaining to Information processing by human beings 
Kave^ in the past, been determined by learning and memory experiments 
with nonsense syllables^ number sequences, etc« Because of the rela- 
tive simplicity of these types of experiments, the flow of Information 
and the subsequent information processing of these senseless texts 
could be measured and varied according to exact prescriptions. However 
in the real world we a re"" concerned with the processing of 'Wanlngful 
informat ions' ' not senseless texts* When the parameters are determined 
from senseless texts uncertainty is contained in the extrapolation to 
meaningful materials* Therefore, if we could determine the subjective 
information directly from meaningful material, we would most certainly 
reduce these uncertainties. As a result, one would be able to answer 
questions of the following type: ^ 

1« What is the amount of information contained in the laws 
of thermodynamics for a specific learner? 

2. How much information does a learner gain, in a certain 
period of instruction, under a given set of instructional 
conditions? 

3. How great Is the flow of information through a particular 
instruction-learning. channel? Is It too much? Is It too 
little? 

A. How much does the learner already know? 
5. How much does the learner tieed to know? 



Instruction Learning Process 

« 

Instruction and learning arc two vital aspects of the educatlona 
process that depend upon the manipulation of three activities: the 
Input of meaningful Information; the processing of this Information; 
and the output of meaningful Information. These three activities form 
the basis for the science of Information theory. 

The science of Information theory, since It deals with the fun- 
damental processes Involved In the Instruction-learning process, pro- 
vides the quantitative Instruments needed to describe this process. 

When we discuss the Instruction-learning process we are essen- 
tlally describing a communication system. Typically, a communication 
system can be described In terms of the following (see Figure 1): 

Information source: The mechanism that selects a desired 
message out of a set of possible messages. 

Transmitter: Encodes the message Into a signal. 

Channel: The medium through which the signal Is transmitted. 

Receiver; Accepts and decodes the transmitted signal Into 
the message. 

Destination: Interprets the message. 

■ Nolsre: Unwanted additions Imposed on the message. These 
additions can be either from an external source, Inter- 
nal source or comblnatloh^ of the two. Noise changes 
the Intended message. . 



Quantifying Information , * 

blnce Instruction Is concerned with the transference of Infor- 
mation and learning Is the act of (or the result of) the Information 
transfer. It Is obvious that we are not only describing a communication 
system, but also an Instruction-learning system. The Information source 
and transmitter being the teacher, textbook, computer, etc., while the 
receiver Is the learner. Because our Instruction-learning system Is 9n 
Information processing .system, we are now faced with several questions: 

1. What is Information? 

2. How can we measure Information? 

3. What are the characteristics of an efficient coding process? 
H, How could we measure the capacity of a channel? 
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Figure J ."Slmpl ifled VIcwof Communication System. , 



5. What are the general characterUtics of notse? 

6. What are the effects of notse on a message? 

7. How do we Increase or reduce the effects of noise? 

All of the questions are intriguing and deserve attention. How- 
ever, this Investigation addresses Itself only to the first two ques- 
tions. The other questions provide a basis 'or additional studies along 
with extending the results of this Investigation. 

Many definitions of Information are given In the literature 
(Ash, 1965; Edwards, 19M; Frank, 1962; Fuchs, 1968; Gullbaud, 1959; 
Khlnchln, 1957; Singh, 1966; Weltner, 1970; & others). But one appro- 
priate for this Investigation Is the definition given by Weaver (1969). 
He says, "Information Is a measure of one's freedom of choice when one'- 
selects a message." This conveniently ties the deffnltlon of Informa- 
tion and Its measurement directly together. 

To take the simplest case of two alternative messages (yes or 
no), we will say that the Information associated with this ensemble 
of two messages Is unity. The concept of Information, then, applies 
not to the Individual message^ but to the ensemble as a whole. ■ 

To be more precise, we- can define the amount of Infornftitlon by 



the logarithm of the number of 
In the ensemble. Thus, In our 
Information Is proportional to 



available choices (messages) contained 
case of two messages (yes or no) , the 
the logarithm of 2 to the base 2 such 
that: 10922 » 1 which Is unity. This unit of Information Is oailed . 
a "bit" from the two words bln«|ry digit. To generalize then, Vf^ts,^ 
have 8 alternative messages, anpong which we are equally free to' dhoose, 
we have log28 « 3 giving us an/ ensemble consisting of 3 bits of Infor- 
mation. / 

/ 

This situation Is completely adequate If we are only dealing 
with messages thac have an Equally probable chance of belnj selected. 
The probability of this occurring In language structure Is extremely 
small and Is zero for the /English language. In this study we are con- 
cerned with the Information content cf the English language which Is 
cogjposed of messages (letters, signs) arranged Into stochastic ensem- 
bles. These ensembles are arranyed according to certain probabilities 
In which the probabilities depend' on previous events. 

Because of the probabl | Isl Ic dependencies of the message (let- 
ters), It becomes exceedingly. more difficult to measure the amount of 
Information In the English language. Hov/ever, Shannon (1951) derived 
a "guessing proredure" for analyzing the syntactical Information content 
of meaningful texts. His guiisstng procedure enables one to calculate 
the subjective Information of a text for a specific learner. The 
learner attempts to guess the ensemble, letter by letter. After each 

1 
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guess he Is told whether his guess wa's correct or not. If Incorrect, 
he guesses again. If correct, he guesses the next letter or message 
of the ensemble until the entire ensemble has been derived. This pro^ 
cedure Is based on the assumption that at each guess the learner will 
name the letter with the highest sublectlve probability . Thus, we have 
a measure of the amount of Information a particular ensemble has for a 
particular leJirner. Shannon's equation forms the basis for this Inves- 
tigation and Is presented later. An example of the guessing procedure 
and subsequent analysis follows. 



Example of Guessing Procedure 

The guessing procedure enables one to empirically estimate the 
subjective Information of a text for a particular subject. The subjec- 
tive Information can be calculated according to several techniques. 
However, each .echnlque assumes that the subject guesses the. sign with 
the highest subjective probability. Consider the following hypothetical 
example: 

THE COW JUMPED 0 T • 

612 1 421 1 811111 1 3 

Under each letter of the text Is the number of guesses the subject made 
until the correct sign (letter) was guessed. From these numbers It Is 
relatively easy to calculate the Information content of the text by 
adding the Information value of the signs. The first letter (T) con- 
tains I0926 bits of Information, the second letter (H) log^l bits of 
information. Therefore, the first letter (T) contains 2.58 bits 
(logoS - 2.58) whereas the second letter (H) contains 0 bits (4og2l - 0) 
While the first letter dl.d contain Information for the subject (2.58 
bits), the second was perhaps expected with certainty and was, therefore 
devoid of any Information (0 bits). The total Information of the text 
can now be measured In bits as: ^ ^ 

H(text) - logjS + log2l + log22 + log2l + log2't + log22 

+ log2l + log2l + log28 + log2l + log2l + log2l 

+ log2l + 10921 + log2l + log23 

« log28 + log26 + log2'» + log23 + 2 log22 + 10 log^l 

- 3.00 + 2.58 +.2.00 + 1.58 + 2.00 + 0 

» 11.16 BITS 
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Dividing this by the length of the "-cxt N - 16 we arrive at .698 Bits/ 
Sign for .the information content of our sample text. 

Although this method I? relatively simple^ It does not provide 
the best estimate of the Information content. Shannon (1931 ) derive^d 
equation (1) which according to Frank (1962) Is not only a better mea** 
sure but also Is closer to the actual figure than other calculations, 
would indicate. 

H(tcxt^ f 2 Nj. • (r • ld(r) - (r-1) • Id(r-I)) (1) 

Where: N » amount of Information measured in bits 

« absolute frequency with which the number r occurs ' 
In the guessing sequence of N signs 

Id " logarithm to the base 2 (1og2r) 

f 

Applying Shannon's equation to our example^ we obtain 18.248 bits as 
the information of the text segment or ].kOS bits/sign which is a 
better estimate than our previous figure of .698 bits/sign. 

Procedures and Results 

This study consists of five procedural phases of operation. 
They are: : ~ 

Phase I: The design and development of computer programs 
to present and process the experimental data. 

Phase If: The selection and presentation of textual Infor- 
mation to selected subjects and the collection 
of their responses. 

Phase III: Calculation of subjective Information content 
contained In the textual material for selected 
subjects. 

Phase IV:* Derivation of equations to fit experimental data* 

Phase V: Selection of ''best-fitting'^ equation, and deletion 
of outl ler point. 
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PHASE! '^'^miajvnf 

Computer Program Development 

Two computer programs w^re written specifically for this Inves 
tlgatlon. Two other programs were used In the synthesis of the data. 
•|he programs, written in BASIC, were designed to operate in a time- 
shared environment. . ' . 



Program CYBERif 

Program CYBER4 was designed as a general purpose program to pre- 
sent selected textual passages to the subjects. CYBER4 can present 
textual information by letters, words, phrases, sentences, paragraphs, 
or any combination thereof. The content presentation can be predeter- 
mined by the Investigator or the program can be adapted to function 
under learner control. In addition, it can also be adapted to display 
Information at any intellectual level. 

The program initially outputs an extended phrase or set of 
phrases. The length of the phrase is pre-programmed by the investiga- 
tor and, for the most part, depends on the complexity of the material 
and the relative sophistication of the subject with regard to the 
academic content. 

After the Initial phrase is printed by the computer terminal, 
the subject is required to guess, letter by letter, the subsequent 
message. If the subject is incorrect, his response is placed In a 
'/response file;'' a zero is recorded in a *'score flie;'* and the subject 
is required to guess again. This process continues until his response 
is correct. 

If the subject's guess is corl'ect, his response is placed in 
the ''response file;" a one is recorded in the "score file;" his cor- 
rect answer Is acknowledged by the computer; and the next letter in 
the guessing sequence is presented. When the subject has responded 
to. the entire text, he is thanked for his effort and the time of com- 
pletion is recorded by the computer. 

) 

Program CYBER2 

Program CYBER2 was developed to calculate the amount of infor- 
mation for each subject, using equation (1) derived by Shannon (1951). 
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As can be seen In the sample output listed In Table 1, CYBER2 
outputs the percentage of Incorrect responses (X s) per subject and the. 
amount of Information per subject (Y^s) contained In the text for that 
particular subject. The measurement of Information Is In units of 
bits/sign, the sign being the message-unit letter. Each pnlr of numbers 
represent the X and Y coordinates used In fitting an equation to the 
data. In addition to calculating the X and Y values, CYBER2 also order* 
the results from the lowest to highest on the percentage of Incorrect 
responses (X^s 

TABLE 1 

X AND Y COORDINATES FOR HYPOTHETICAL SAMPLE 



Percent Incorrect (X^s) 



X( 1) - 22.2222 

X( 2) - 25.0000 

X( 3) - 26.0000 

X( 4) « 60.0000 



Bits/Sign (Ygs) 



Y( 1) « .9148 

Y( 2) - .5629 

Y( 3) « 1.3686 

Yfi*) - 1.8500 



Progr'am EQUFIT ' «. 

Program EQUFIT Is a commercially developed program available on 
the HG-255 time-sharing computer system at Arizona State University. 
EQUFIT applies curve fitting techniques to determine which of six general 
types of curves best fits the supplied data. The six general types of 
curves range from a linear function to a hyperbolic function. 

• # 

cniiriT ^ ^"'^ ^ coordinates generated by Program CYBER2 are used by 
EQUFIT to determine the equations that have the ''best fit" for the sup- 
plied data. EQUFIT outputs the following Information for each equation 
specified by the user: , 

1. Index of determination for each equation 

2. Coefficients for each equation 

3* X and Y values as supplied by user 

4. Calculated values of Y for each specified equation 

,5. The percentage difference between thc/actual Y value and 
the calculated Y value for each specified equation. 
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Program PTAFIT. 

PTAFIT Is a, modified and Improved version of EQ.UFIT developed by 
Plan-Test Associates of Phoenix, Arizona.* PTAFIT greatly extends the 
capabilities of EQUFIT. In addition to the Information supplied by 
EQUFIT, PTAFIT supplies the following: 

I. The level of significance at which the equation f\ts the 
supplied data. 

« . 2. The value of each coefficient, the standard deviation 
associated with that coefficient and confidence limits 
for tfle coefflcleht. The confidence limits are chosen 
by t\9e U5«r and can be at the 90^, 95|, and/or the 99^ 
level of significance. . ■♦• 

3. Confidence bands about each (X, Y) coordinate. The user 
mav^ specify the 90%, 95% and/or 33% prediction limits. 

k. Evaluation of how significant Is the apparent differences 
between equations. 

• '* . J 

PHASE II 



S electton and Presentatton of Textual Information 
J 

• 

The textual Information for this Investigation was selected 
from two sireas. The first area was concerned with academic facts In 
the areas of Information science* All participants had either prior 
experience In the academic field or were enrolled In an Information 
science course. The second area was concerned with current events. 
The most topical of current events » at the time of the Investigation, 
was the Watergate affair. It was being presented dally on television 
and In the news media/ It was thought that all subjects would be 
knowledgeable In this area; however, this was not the case. A greater 
percentage of Incorrect responses was made on the Watergate textual 
material than on the Information science textual material. 



Experimental Subjects 

A total of 239 subjects were available from the population of 
graduate students In the College of Education. An analysis o^ the 



*Plan-Test Associates, Financial Center, Suite 669, 3^^3 North 
Central Avenue, Pho6nIx, Arizona 850I2« 



subjects* academic background Indicated that all participants were 
functioning within the normal range of Intel 1 tgence. All subjects 
used In this Investigation were enrolled In College of Education 
graduate classes In an uncontrolled manner typical of college enroll* 
ment. It Is assumed that all subjects used In this investigation are 
representative. In term of language skills^ of the population of 
college graduates whose primary language is English. 

Out of the 239 subjects available to participate In this inves* 
tigatlon 118 were selected. These 118 subjects were divided Into 
three groups based upon their academic, background. 

Croup 1 : 25 subjects 

A test group to validate the computer programs and 
selection of the textual Information. The data 
derived from these subjects was not used In the 
final determination of the equation. 

Group 2: 59 subjects 

An experimental group of subjects who were exposed 
to the two types of textual materials. Group 2 
subjects were of yarled academic background and 
therefore considered to be less reliable In terms 
of their usefulness as a homogeneous group. The 
textual material presented to this group consisted 
of both academic materials (Information Science) 
and current material (Watergate affair). The data [ 
generated by Group 2 was suspect because of numer-, 
ous computer failures throughout the experimental 
period and was therefore rejected. 

Group 3: 3^ subjects 

An experimental group of subjects who were equiva- 
lent In terms of academic background In the field 
of Information science. All of the Group 3 sub- 
jects were concurrently enrolled In an Information 
science course at the time of the study. The 
- textual material presented to this group was 
directly concerned with previously learned mater- 
ial and material they were about to study In their 
class. This group provided the most reliable data 
for the final determination of the derived equation. 
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' PHASE III 

Calculation of Subjective Information 

After all subjects had completed the guessing i^quences^ Program 

CYBER2 processed the ''score file/' calculated the percentage of Incor*- 

rect responses and computed the subject I ve^ ^textual Information for 
each subject according to equation. (1): 

H(te?ct) » Z N • <r • ld(r) - (r-1) • ld.(r-l)) (1) 

Table 2 presents the results derived from test Group 3 as calcu- 
lated 5y Program CYBER?.. This table shows the percentage of Incorrect 
responses and the corresponding Information content for each of the 
subjects. The tabte Indicates that, In general, as the percentage of 
incorrect responses Increases the amount of Information Increases. 
This tends to confirm the theoretical predictions. ' 



PHASE IV 



Determination of Equations 

After all subjects had completed the guessing sequences, program 
CYBER2 processed the "Score File," calculated the percentage of Incor- 
rect responses and Qomputed the subjective textual Information for each 
subject according to equation (1). This provided the necessary Infor- 
mation for EQUFIT to process. 

Program EQUFIT provides a least squares curve fit for the data 
generated In Phase M t by program CYBER2. The six curve types and 
results are displayed in Table 3. The index of determination indicates 
that (equations 6, 3 and 1, respectively, provide a fit that is "quite 
good." 

Examination of Table 3 would tend to indicate that equation (6) 
is the best fit. Although this equation shows the highest Index of 
Determination, the other two equation^ are so close that It is difficult 
to judge which is the best fitting equation using only the Index >of 
Determination* 

Ope would prefer the linear function because of Its ease of 
usage, but only if It proved to be "as good a fit'* as the other, func- 
tions. The eqUatibn derived by Weltner (196?) for the German, language 
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. Table 2 
X AND y COORDINATES FOR GROUP 3 



Percent Incorrect Btts/SIgn 







■ 7 6^2^ 


T \ 




B 99^1 
* • 1 


x( 


2) 


« 10.0000 


T \ 


2^ ■ 




x( 

A \ 




- 10.0000 


T V 


J9 ■ 




x( 




■ 12.5000 


Y^ 

1 \ 


k) 1 


i ^9qq 


x( 

A \ 


5) 


« 1^.0000 

1 ^ • WW 


Y^ 


5) 1 
P/ ■ 


B UqUo 


A \ 


6) 


■ 1^.0000 

1 ^ • WW 


Y^ 
1 \ 


0/ ■ 


■ • PP^'O 


x( 

A \ 


7) 


1 ^ • ^ W"l w 


Y^ 
1 \ 


7V - 

99 




x( 

A \ 


8) 


IV* 


Y^ 
1 \ 


0/ ■ 


B ^077 


x( 


9) 


■17.5000 


Yf 
1 \ 






x( 

A \ 


10) 


■ 17.5000 

1 / • vv w 


Y^ 
1 \ 


10) ■ 


B UloL 


. x( 


11) 


■ 20.0000 

V • WW 


y( 
1 \ 


in a 


B i;iin7 


x( 

A \ 


12) 


■ 20 0000 

fcW • WW 


Y^ 


IC) ■ 




x( 

A \ 


13) 


B 20 0000 

V • WW 


y/ 






x( 


1A) 

IT/ 


- 20.0000 

fcV* WW 


Y^ 

1 \ 




B q286 


x( 

A \ 


15) 

' ^9 


« 20.0000 

V • WW 


Y^ 






* x( 


16) 


» 20.0000 

V • WW 


Y^ 
1 \ 


16) - 




x( 


17) 


•21.A286 


Y( 


17) ■ 


' .5364 


x( 


18) 


- 22.5000 


Y( 


18) - 


- .6406 


x( 


•rs) 


- 27.2727 


Xi 


i9r- 


> .6918 


x( 


20) 


- 27.5000 f 

- 27. $000 i 


Y( 


20) - 


' .9640 ^ 


x( 


21) 


Y( 


21) - 


• 1.0123 ^ 


'X( 


22) 


« 27.5000 


Y( 


22) - 


» .9593 


x( 


23) 


- 27.5000.'^ . 


Y( 


23) ■ 


' .8112 


x( 


Ik) 


? 32.5000 


•* Y( 


24) - 


- .9116 


x( 


25) 


- 35.0000 


Y( 


25) ■ 


• 1.2194 


x( 


26) 


- 35.0000 


Y( 


26) - 


• 1.2905 


x( 


27) 


-35.0000 


Y( 


27) • 


• 1.2210. 


x( 


28) 


- 37.5000 


Y( 


28) - 


• .7500 


x( 


29) 


- 37.5000 


Y( 


29) ■ 


• 1.2191 ' 


x( 


30) 


- 38.A6I5 


Y( 


30) - 


' 1.1554 


x( 


31) 


- 40.0000 


Y( 


31) ■ 


' 1.2366 


x( 


32) 


» 42.5000 


Y( 


32) - 


• 1.3555 


x( 


33) 


- 45.4545 


Y( 


33) • 


' 1.2501 


x( 


3A) 


- 56.25:00 


Y( 

r 


34.) - 


' 1.5010 



# 



18 



BEST copy AWUU8U 



TABLE 3 
LEAST SQUARES CURVE FIT 
EQUFIT PROGRAM 



Curve Type 



K Y-A+(B*X) 

2. Y-A*EXP(B*X) 

3. Y-A*(X+B) 
i|. Y-A+(B/X)« 

5. Y-1/(A+B*X) 

6. Y-X/(A+B*X) 



Index of 
Determination 



.892389 
.851611 
.922853 
.693833 
.7IIOA7 
.930395 



-1.A9193E-02 
.229436 

2.i»l865E-02 

1.35166 

3.48035 
36.2543 



B 



3.O249IE-02 
4J9008E-02 
1.05825 ' 
•12.2042 
■ 6.92674E-02 
• 5.O96O8E-02 
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is of a linear nature (see Figure 2). This would lead us to suspect 
. that the English language equation might .also exhibit similar linear 
properties. Therefore, It was decided, at this point, to further ana 
1yze the Group 3 data. This is done In Phase V. . 



. PHASE V 



Selection of "Be&t-FI tting" Equation 

Phase V, In part, utilized the program PTAFIT to further analyze 
the equations to determine which would be the most appropriate to 
describe the empirical data. 

Table 4 -shows the 1east*curves fit generated by program PTAFIT. 
All equations .are shown to be significant at the 33% level of confl' 
dence. ' 



TABLE k 



PTAFIT LEAST SQUARES CURVE FIT 



Curve Type 



IndeXs of 
Determination 



SIgnl f Icance 
90^ SS%' 33% 



1 Y-A+(B*X) 

2 Y-A*EXP(B*X) 

3 Y-A*(X+B) 
k Y-A+(B/X) 

5 Y-1/(A+B*X) 

6 Y-X/(A+B*xjL 



.8924 
.8517 
.92^9 
.691? 

.7ni 

.9304 



Number of Observations: 34 



-1.494E-2 3.025E-2 * * ^ 

.2294 .0419 * * . * 

2.418E-2 1.058 * ft A 

1.353 -12.2 * * * 

3.48 - 6.927E-2 * * * 

36.26 - 5.097E'2 * * * 



Equations 6, 3 and 1 again exhibit the highest Index of Deter- 
mination and therefore are prime candidates for further examination. 
Tables S and 6 show the standard deviation and confidence limits for 
the coefficients of equations 6 and 1. 
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TABLE 5 

EQUATION TYPE 6 
Y-X/(A + B*X) 



Coefficient 


A/alue 


Standard 


95? 








Deviation 


Confidence 


Limits 


A 


36.26 


• 1.75 


32.68 


39.83 


B 


- 5«.09E-2 


9.52E-2 


- .2i» 


.14 



^1^ 







TABLE 6 


• 






EQUATION TYPE \ 




« 




Y-A+(B*X) 


« 


Coefficient 


Value 


Standard 
Deviation* 


35% 

Confidence Limits 


A • 
B 


-1 .A9E-2 
3.02E-2 


5.20E-2 
1.85E-3 


- .12 ^ 9.11E-2 
2.64E-2 3.'»0E-2 
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Tab's 7 shows the PTAFIT output comparing the fit of all pairs 

of equations. It shows that equation 6 does not fit significantly 

better .than equation I. Therefore, since equation I represents a 

straight line It will be given special consideration In vlew;of Its 
convenience of Interpretation. 



TABLE 7 



COMPAjllSON OF INDICES OF DETERMINATION 



y5*. CQMFIDGNCe 



CQMF I DfeNCe 



t. 

1 

£ 
5 
4 



CURV»£ \\\2. 
1 £ 5 4 5 6 

♦ ♦ ♦ ♦ 

♦ ♦ ♦ ♦ 

♦ ♦ ♦ ♦ 

♦ ♦♦♦♦♦ 

♦ ♦ ♦ 

♦ ♦ ♦ 



•5 



CURVE NO. 
1 £ 3 4 5' 6 

♦ ♦ ♦ > ^ ^ 

♦ ♦ ♦ ♦ ♦ # 



r RiifERISKS IMDICfiTE PfilRS GF CURVES RRE'THE 'SflME' , 
•;Nar m HRVG SIGNIFICflNfLV DIFFERENT INDtCE' 

HE TERM I MAT I ON. fir THE SffiTEDCaNFIDEhCG LEVELS:'. 



GF 



Tables 8 and 9 show further output from the PTAFIT protgrem. The 
int with the coordinates of 37.5 and ^0.75 appeared suspiciously dev- 
iant and an outlier test run on the residuals confirms this suspicion. 
The outlier test was run using a Plan-Test Associates program based on ' 
a method reported by Chazal (1967). The graphical and tabular output 
from this progrd.ii are shown In Tables 10 and 11, respectively. 

In view of this, PTAFIT was rerun omitting the outlier. The 
resulting outputs are shown in Tables 11 through 17. We now find that 
equation 3 fits better than equation 6 (see Table 12), but still neither' 
is significantly bette( than equation 1 (see Table 15). 
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TABLE 8 

BEST COPY mmi 

EQUATION TYPE 1 
Y-A+(B*X) 



. 

X 


Y 


V-CRLC 




95 PREDICT I GN 


Linri 


•i IMC 


H 














.22-M 


.2177 


4 . 763 


-4 . 4Tf-".E- 


~2 


.4302 


10 • 


. 2^.64 


. 2375 


-7.352 






. 5479 


ICi. 


. ^i}97 


. 2375 


.7509 • 


£. 71i-:E- 




.5479 


1£.5 


. 3255 


. 3632 


-1 0.37 


1 Ii4:-: 




.6215 


15 


. 4-?42 


.4333 


12.63 






. 6955 


15 


. 5328 . 


,4333 


21.42 






. 6955 


15.35 


. 4233 


. 4494 


-5.694 


• 1 ■■ ■■ 




. 7059 


15. 


.3i:i77 


.4503 


-31.67 


r=i"::"; 

• A ■m>'*»»' 




.• 7063 


17.5 


.4321 


.5144 


-6.234 






. 7693 


1 7. 5 


.4724 


.5144 • 


-3. 17 




I 


. 7693 


20 


. 5t.27 


.5901 


-4.636 ■ 






. 3445 


£0 


.5693 


.5901 


-3.4 53 


. 3356 


0 


. 3445 


£0 


. 54 O?-^ 


.5901 


-3.365 


.3556 




.3445 


£0 


. .t'.Oc' 


.5901 


2. 024 


.3556 




.3445 


£0 


• . 52':.4 


.5901 


-10.79 


. 3356 




.3445 


£0 


. 52St. 


.5901 


. -10.42 


.3556 




. 3445 


£1.4 5 


. 53^4 


. 6333 


-15.3 


. 3793 




. 3374 


££.5 


. t.4 f)>l 


. 6657 


-3. 769 


.41 19 




.9195 


£7. £7 


.6913 


.31 '• 


-14.59 


.5564 




1 . 064 


£7.5 


. 9593 


. 3 1 69 


17.43 


.5633 




1.071 


27.5 


.31 12 


.3169 


-.7032 


. 5633 . 




1.C71 


(■7. 5 


. 9 '£.4 


.3169 


13. 


. 5633 




1 . 071 


£7. y 


1. 012 


.3169'' 


23.33 


. 5635 




1 . 071 


32. 5 


.9116 


. 9632 


-5.346 


.7134 




1 . 2£3 


35 


1.291 


1 . 0-14 


23.63 


.7373 




1.3 


35 


1.221 


1 . 044 


16.97 


.7373 




1 . 3 


_ 55. ._ 


...1.219 


1 . 044 


16.73 


. 7373 




1 . 3 


/37.5 


. 75 


1.119 


t 


. 362 




1 . 577 


'^T. 5 


1.219 


1 . 1 19 


8. 392 


. 362 




1 . 577 


33.41:. 


1 . 1 55 


1 . 1 49 


. 5657 


.39 04 


t 


1.407 


4 0 • 


1 . 237 


1 . 195 


3.507 


. 9 553 




) .454 


42.5 


1 . 35b 


1.271 


6.711 • 


1 . 009 




1.5 52 


45.45 


1 


1 . 36 


-3. 036 


1 . 096 




1 . 624 


5t..25 


1.501 


1 . 637 


-11. 01 


1 . 4 OS 




.1 .965 
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TABLE 9 
•aUATION TYPE 6 
Y-X/(A+B*X) 







Y-CflLC 




95 ••; PREniCTIDH 


LIMITS: 


\ IHC 


H 












. ££31 


. £145 


6.347 




.£461 


1 l!l 


. £»:.64 


. £793 


"4 . 776 


• c.'4r.'c' 


.331 1 


1 U 


. £397 


. £793 


3. 553 


.£4££ // 


.3311 


1 £ . 5 


. Jc!55 


. 35 1 


-7. £5£ 


. £955 


. 4 3£1 


1 .« 


. 4 ^411: 


. 4££7 


16.9 5 


.3457 


. 5438 


4 C 

1 Z.* 


. 5? £3 


.4££7 


£6. 06 


.3457 


.541:8 


1 cr -.c- 

1 *::• 


. 4 cl z- Z' 




-£. 065 


. 35£5 


. 5bi;i4 . 


1 I'C. 




. 4 5 ?6 


-£9. 04 


. 35 5 


.5618 . 


1 r . ^' 


. 4S£1 


.4949 


-£. 531 


. 5 9 '*' 


.6681.-- 


1 1 . 


. 4 7£4 


.'4949 


-4. 54£ 


. -"19 


. 668 1 ■ 




. 5t.£7 


.5-676 


3663 


.4]:?? 


. 807£ 


£0 


. 5695 


. 5676 


. 3346 


. .4::?? 


. S07£ 


cU 


. 541:17 


. 5676 


-4. 74£. 


. 4*;: 77 


. 8i:i7£ 


'Z* 1*1 


• 60£ 


. 5tr' 1 t« 


6. 057 


. 4:377 


. 8i;i7£ 


£0 


. 5£64 


.5676 


-7. £61 


.4377 


. 8i:i7£ 




. 5£36 


.5676 


-6.374 


.4 377 


. 8i:i7£ , 


£1.45 


.5364 


. 6095 


-11.39 


. 4t.££ 


. 8846 


k- k- * 


. 64 06 


.'64 09 


-4.4S3E-£ * 


. 48 


. 864 1 


C* 1 ' • il' i " 


.6913 


• I C' c c 


-11. 55 


. 554':. 


1 . 3£6 


£7.5 


. 9593 


. 739 


£ 1 . 53 


; 558 


1 . 346 


£7.5 


.311£ 


. 739 


£. 309 


. 558 


1 . 546 • 


£7.5 


. 964 


. 739 


c'll* • 1 r 


.558' 


1 . :?46 


£7.5 


1.01 £ 


. 739 


i£'3 . cl>:- 


. 558 


1 . 346 


5£.5 


.9116 


' . 9 594 


-d.956 


. t.c'84 


1 . 858 


35 


l.£91 


1 . 015 




, 6e 1 1 


£. .'87 


35 


l.££l 


1 . 015 




.6^.1 1 


£ .18 7 


'55 


l.£19 


1.015 




.•E.Al 1 


£. 187 


57.5 


. 75 


1 . 09£ 


-!:1.31 


. 3 


£.583 


37.5 


l.£19 


1 . 09£ 


1 1 . ^4 


. 68 £3 


£.58 3 


33.4 k 


1. 155 


1 . i£l 


£.9 IB? 


. 7 0 38 • 


c* . 1 5 1' 


4i:i 


1 « d z't 


1. 169 




.78tl-l 


3. 068 


4£.5 


1 . :-':<6 


1 .£47 


8. r-f.i 


. 1 J 


3.681 


45.45 


1 

1 « u 


1 . 3 59 




-?0 cr 
. 1 CT. .» 


4 . 64 .-: . 


56. £5 


1.51:11 


i . 635 


- 1 CI . ? 1 




16.85 



) 
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Vable 10 

OUfLlER TEST USING CHAZAL'S METHOD 



SYMBOL riEFJNITIDri: 

♦ = \']Efi\\ OF :nMPLE liHTR* 

: s= i^PfiWB MEPH DF riHTR •-.Ei-iCLUDIflG DUTLIEP;. • 
- = LDl.itP □UTLIEP LIMIT 
•f = UPPEP GUTLIEP LIMIT 



ft 

ri 

p 

L 

E 

I 
D 



0 . 0+ 



5 . 0+ 



1 0 . 0+ 



c'0 . 0+ 



c5 . IJ+ 



: 0 . 0+ 



-.16 



0 . 04 



i:i.£4 



■ 0.44 



0 . 64 



♦ 
♦ 



♦ 



- ♦ 

-.16- 



.•♦^ • • . .+ , 
0 . 04 



+ 

+ 
+ 
+ 

■ + 

+ 
+ 

♦ + 

♦ + 

♦ + 

+ 
+ 

+ 

0 . £4 



■0.44 



. . .+ 

0'.»:.4 



INCFEMEriT DF HDPirnr^fHL ::.i:RLE: c'.UOE-Oc' 
IMCPEMEra DF VEPTIChL IC^hLE: 1.00 



r<D. UF P□Ir^T.:. PLDTIEU DH VEPTICftL .XhLE: .•:4 
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BfSr copy AVAILABLE 

* 

TABLE 11 

OUTLIER TEST USING CHAZAL"S METHOD 



MERN • » • 


□LITLIER 


LIMITS 


Ci.OlO 


-.£37 


0.£6£ 


0.0i>£ 


-.£37 


• 0.£6£ 


-.0£1 


-.£37 


0.£6£ 


- . 038 


-.£37 


0.£6£ 


0 . 094 


-.£37 


0.£6£ 


0 . 055 


- .£37 


0 .£6£ 


- . 0£6 


-.£37 


0.£6£ 


-.143 


-.£37 


0.£6£ 


- . 04£ 


-.£37 


. 0.£6£ 


- . 03£ 


-.£37 


0.£6£ 


-.0£0 


-.£37 


0.£6£ 


- . 049 


-.£37 


0.£6£ 


0.01£ 


-.£37 


0.£6£ 


- . 064 , 


-.£37 


0.£6£ 


- . 06 1 


-.£37 


0.£6£ 


- . 0£7 


- .£37 


ft.£6£ 

*a*^ a ^» 




a r 


*J a 


. ■JL.--I 


a 1 




-.118 


. -.£37 


0.£6£ 


-.006 


-.£37 • 


0.£6£ 


0.147, 


-.£37 


0.£6£ 


0.195 


-.£37 


0.£6£ 


0.14£ / 


-.£37 


0.£6£ 


-.009 


-.£37 


0.£6£ 


0.177 


-.£37 


0.£6£ 


0.175 


-.£37 


0.£6£ 


0.£47 


-.£37 


0.£6£ 


0.100 


-.£37 


0.£6£ 


-.369 


-.£37 


0.£6£ 


0.006 


-.£37 


0.£6£ 


0 . 04£ 


-.£37 


0.£6£ 


0 . 085 


-.£37 


0.£6£ 


-.110 


-.£37 


0.£6£ 


-.186 


-.£37 


0.£6£ 



GRRMD MEAN (EXCLUDING DUTLIERS::' = 0.013 
BETIJEEN SD •:: EXCLUDING OUTLIERS) = 0.10£60 
WITHIN SD •:: EXCLUDING DUTLIERS.:' = 0.00000 



♦ = EXCLUDED DN PASS ND. 1 
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TABLE 12 

LEAST SQUARES CURVE FIT WITH OUTLIER REMOVED 



curve t'.F'f: ■ 


I HUE; : OF 


1=1 




I: 






liETEPNIt<HTIDM 






















4 




c' 'v=h*e::p* 






4 . 




♦ 




V-Kl*' :;ti; ' 




c'.c'44E-^ 


1 . 




♦ 




4 ■i*=H+ ' t : ; ' 


. r 1 1 f : 


1 . 37^:. . 


-Ic- 




♦ 




5 Y=l • '. H+l:*: : ■ 


.7171 




"~ r • 




♦ 




<£■ y~: '. - '.fi+i-*'. : • 


. 9 •:>.-^4 


::>:..7i 






4 





f^Uf'lttF: DF DEIEF-VhTIDH:.: 
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TABU .3 '"^«"^^^^/Maif 
EQUATION TYPE 3 . 
. Y«A''>(XtB) 
LEAST SQUARES CURVE FIT WITH OUTLIER REMOVED 



CQEFFICIENT VmLUE tTF-iHIiftFTi + - 

nfvifniDh cdhfihemce limiT^ 

I f^: £.c'mE--c- 1. 1ft- 1.64cE-c: 3.C't.7E-c- . 

■ b: l.OSt. 4..S47E-C: ,3S7£ l.iy.- 
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BEST COPY AVAILABLE 



TABLE \h * 
EQUATION TYPE 1 
Y»A+(B*X) 

LEAST SQUARES CURVE FIT WITH OUTLIER REMOVED 



CDEFFICIEriT 



r'RLUE 



?:TRHriRPri 
IiEVlRTIDM 



R : -3 . 08 1 E - £ 4 . 44£:E - c' 

B: 3.13SE-C: . 1 .t.O::-:E-:-: 

ESTIMRTED PDFULRT iLh " iTRMIiRpn HEVIRTIDM 
RBOUT REGRESSION LltlE = .1041 



95 

cdmfihemce limits 



-.lcl4 

£ .so jE-£ 



5. 

3.459E-C: 



/I 
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BEST mmmiE 



TABLE 15 

COMPARISON OF INDICES OF DETERMINATJON 
WITH OUTLIER REMOVED 



95-i CONFIHEIHCE 

CURVE MO. 
.1 £ 3 4 5 6 

3 ♦ ♦ ♦ ♦ 
6 ♦ ♦ ♦ ♦ 
,1 ♦ ♦ ♦ ♦ 

5 ♦ ♦ ♦ 

4 ♦ ♦ ♦ 



CDMFIDENCE 

. CURVE MO. 
1 £ :5 4 5 6 
3 ♦ ♦ ♦ ♦ 
6 ♦ ♦ ♦ 
1 ♦ ♦ ♦ ♦ 

5 ♦ ♦ ♦ 
'4 ♦ ♦ ♦ 



♦ RSTERISKS INDICRTE PPIRS OF CURVES RRE THE ' SRME^'. 
<MOT SHOWN TO HRVE S IGNIFICRNTLY DIFFERENT INDICES OF 
" HETERMINRTION RT THE STRTED CONFIDENCE LEVELS). 
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TABLE 16 
EQUATION TYPE 1 
Y«A+(BAX) 
SUMMARY TABLE WITH OUTLIER REMOVED 




V 




Y-CRLC 


ruF 


95 PPEIiICTIOH 


LIMIT: 


*. I Mi: 


H 












.cc-:?l 




:?.564 


-1 .302£r2 


.433£ 


10 


.:-:>->:a 




-5.663 


. i':i6 1 1 


.5 0 57 


10 


.2-y:*7 




c- 

l_ . J'-* 1 


.0611 


.5037 


1£.5 


. 5£.55 




-•?.757 


.1411 


.•6303 


15 


.4'H4c 




1£-.5S 


.££•0:3 


.657£ 


15 


.5.::c-:5 


. .439 


£1 .37 


.££03 


.657£ 


15. 55 


.4£'5c: 


.45 


-5.:513 


.£319 




15.3:5 


.3077 


.45U'r* 


-31 .76^ 


. £3£9 


• ttS'? 


17.5 


.48^-1 


.5173 


-6.305 


.3>00£ 


.7344 


17.5 


.47c'4 


.5173 


-3.679 


. .300£ 


.7344 




.5»-:..i-7 


.595»f. 


-5.5^3 


.3793 


.3 119 




.56ro 


.595k 


-4 .331 


.3793 


.3119 




.^•^07 


.595»^. 


7 . 


. 3: 7 9 3: 


.31 19 


cO 




.595*:. 


1 . 076 


.3793 


-.31 19 


£0 


.5.5»:".4 




-11.6c' 


.3793 


.31 19 


clft 


. 5c-:::6 


.59^^: 


- 1 1 . £,-5 


.3793 


.3119 


21 .4? 


.5:>.4 


.k4m4 . / 


- 1 6 . c-4 


.4344 


.3563 


CC.5 


.•:-4 Ut- 


.t.739 


-4 .941 


.4531 


.3397 




. 1 :5 




-15.97 


.6076 


1 . n?.9 


£7.5 


.V59 5 


::i:i5 


15.51 ■ 


.614:3 


1 . 0A6 




.yi \2 


.3305 




.6143 


1 . 046 


c-7.5 


.'?t-A 


.3 05 


1 6 . 03 


.6143 ■ 


1 . 046 


c-7.5 


1 . i J 1 £■ 


.33 05 


£•1 .:36 


.6143 


I . 046 


•-•i::: • J 




.9371 


-7.643 


.770£ 


1 .c'04 




1 . ^-H 1 


1 . iV.5 


21 .13 


.:E:475 


1 . C Z' 


•-•er 


1 ..Icl 


1 . 065 


14.61 


.3475 


1 . C O 


•-.er 
■J* _' 


1 .c:l9 


1 . UkS 


14.4£ 


.3475 


1 • c'o:I 


cr 

!• 1 • 


1 


1 . 144 


6.534' 


.9 £4 5 


1 .363 


:5!5.4t. 


1 . 1 55 


1 .174 


■ . -1 .599 ■ 


.9539 


1 .394 


4 0 




1 . c'c'c 


■. 1 .22v 


1 . 0 ij 1 


1 .443 


4 £.5 


1 .356 


1 .3 


4 .£34 


1 . 073 


1 ^.Z- 1' 

I . -^'i — • 


45.45 


1 • C 


1 .3?:- 


- 1 0 . c-5 


1 . 167 


1 .61:5 


56. £5 


1 .^.ul 


1 .731 


-13. £3 ' 


1 .493 


1 .969 



( 
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TABLE 17 
EQUATION TYPE 3 
Y=A*(XfB) 
SUMMARY TABLE WITH OUTLIER REMOVED 



'-CRLC 



DIF 



PREIilCTICri LIMIT:: 



IMC 


H 














. clcl'i 1 


1:15:5 


1 0 .Sb 




.154 


. .c-74:5 




,£i£.if.M 








.c:066 


.3€-c3 


1 0 


. clV'? 7 


• c 7 ■ 


5. 




.£066 


.5i£'C.-3 


12.^ 


.31-55 


• ::4:i:t. 






.£64:? 


.4591 


15 


.4?4c- 




16.il:5 




. -J'C •-•0 


. .557:5 


15 


.53c-:-:; 








. C •-• c* 


.557:? 


15.35 


.4c-:-::E: 


.4.::^.:=: 


-■£'.74:5 




.33c'l 


.571:5 


15.3:5 


.3077 • 


.43^.7 


-C.-9.54 




. -Z* •-• c' *z* 


c • 


17.5 


.4:E:c-] 




-4.051 ■ 




.3:535 


.65:5c' 


17.5 


,47£-4 




-5.?:51 




.3:535 


.k5:5c- 


c:u 


.5t.£'7 


• .5o 0'? 


-3. lc':5 




.443:5 


.7602 


20 






-1 .'P-i'ii:. 




.443:5 


.7i£.0£' 


£ 0 


.5407 


• TioO? 


-t..'?15 




.443:5 


.7t.0c 


£•(• 


.60c: 


. j!Ei 0'? 


3 . 6iv 




.445:5 


.7t-0c 




.5 c.- 6 4 


• 5 15 0'? 


.377 




.443:5 , 


.7t.0c- 




.5c-:E:t.. 




_;5 ^ ':j"^9 




.4438^ 


.7t.0c- 


c-1 .43 


.53t.4 


• 1 


-14 .33 




.47:55 


.£:r?£- 


-t rr 


.i:-4 Ok- 




-c 




.5046 


.:5t-37 


C 1 ' • i" 


. 6-? 1 :? 


.5r?4 


-14 .-f-5 




.t.c'14 


1 . 0I-.5 


£7.5 


.95-? :: 




It .:::•:• 




.tc7 


1 . 075 


£•7.5 


.:511c- 




-1 .1:51 




.t.c'7 


1 . 075 


£•7.5- 


.964 




17.43 




.6c-7 


1 . 1:175 




1 .01 c' 








. i:-iE.-7 


1 . 075 


3c-. 5 


.?1 It- 


.9:54£: 




.7505 


1 .£91 


C* •-' 


1 .c-Hl 


1 . 0b7 


I'l .0 3 




.:51c-t. 


1 .4 


::-i5 


1 .c-£l 


1 . Oi:.7 


14.47 




.:51c-6 


1 .4 


55 


1 .c-J-^ 


1 . Ot-7. 


14.l1':E: 




.:51£k 


1 .4 


37.5 


1 .£r? 


1.15 


6 . Oco 




.:574:5 


1 .51 1 


3:5 .4 1. 


1 .155 


1 .i:=J£ 


. -£.£•'1.9 






1 .554 


4 0 


1 • C 1' 


1 • ii •-• 


.31 0£' 




.'?37£ 


1 .t.c'3 


4£.'.5 


1 ..::56 


1 . ::17 


£-.?53 




.•?•?•?? 


• 1 .735 


45.45 


1 •^•5 


1 .417 


-11 .77 




1 . 074 


1 .S69 




1 .?01 


1 .7&b 


-15. '^5 




1 .345 
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B8I con MWlABLt 

The Engl ish Equation 

Our straight line equation, rounded to three decimal places, 
from Table 14 becomes 

H(text)g - .031X - .031 . (2). 



Where: H(text)^ « subjective textual information per text—English 
And: X « percentage of incorrect responses 



It is interesting to note, however, that the -.031 Intercept could well 
be zero for the population since Its 3S% confidence band spans that 
value (see Table 14); This zero value Is to be expected since the 
theoretical information content should be zero for zero Incorrect 
responses. In other words, If a person does not make a response error 
he Is predicting the te)it with certainty and, therefore, the text con- 
tains zero information for him. The English equation closely approxi- 
mates this. Also the' slope could (at the 95%* level of confidence) span 
from 0.028 to .03^5. 



The German Equation 

The equation derived by Weltner for the German language I's: 



H(text)Q - .039X - .080 (3) 



Where: H(text)g « subjective textual Information per text— German 
'And: X = percentage of Incorrect responses 



We have no measure of the confidence bands on the German data, 
but the agreement between the equations Is quite remarkable. See the 
graphs In Figure 2 (page 21. 

One additional point In regard to the residuals Is the extreme 
runs observed which might suggest a somewhat better fit with higher 
degree polynomials. This has not been pursued In this study. 
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It is .nter'esting to note that the equation can be put into a 
more useful form. The derivation is as follows: 



H(tcxt)E - .031X - .031 



(2) 



Since X » percentage of incorrect responses 

which can be represented by E/N • 100 



Whiere 

And 

Then 



E = number of wrongly guessed signs 
N « text length 

.1 H(text) .031 [E/N • 100] - .031 
H(text) - 3.1E/N j .031 
N*H(text) « 3. IE -1 .031N 



Setting N*H(text) « I, where I = information In bits we have 




With the equation in this form we can now directly evaluate the Infor- 
mation content of a text of length N for a specific subject. In view 
of our earlier evaluation of the confidence on the .03/1 coefficient 
which may well be zero, it may be concluded that our equation is inde- 
pendent of the text length N such that equation (A) at the 05% confi- 
dence level becomes 



I « 3. IE 



(5) 
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Cone 1 us Ion 

' y' ^ 



The application of Shannon's guessing procedure permits us to. 
measure the subjective information of a given text for a specific 
- learner. Unlike senseless texts, the subjective Information of a 
meaningful text can vary from learner to learner. The derivation of 
equation (2) equation (M and equation (5) in this study now allows 
?o?'.a"sJecTf!r?™.''' 'Wanlngfulness" of a specific English text 

ft 

^ l..rn!?""i'u?^^ tells us that the interactive events between 

lintlTJ r . environment have a direct Influence upon his learning. 
Since these Interactive events can now be specified In terms of Infor- 
J«tlon theory the specification of informa?lon-theoJ;.Jased.?r lleJta 
for the selection and completion of education goals Is possible. 

a u.l./l'^!^ equations now permit us to measure Information In terms of • 
a value that is dependent upon the Internal state of the learner. We 

niaJntna? ^'5 /K'^''^'° process information In a meaningful manned 
Uearning) We have also known that subject matter has an Inherent 
difflcu ty factor. The degree of this difficulty factor Jl us the 
internal state of the learner are direct variables ^harinHuence the 
effect veness of an Instructional system. The equations de ?Je" n 
this study, now provide a quant I tat I ve° measure of these factors. This 

i?oii:r'.'r'""%"\'°,'P"'^'^y attainable goals for both the in true- ' 
tlonal system and the learner. In practice, this can provide us with 
the means to specify precisely the lns.tructlons needed^^^rthe Earners. 

• To' the educator this means that he cannot only derive a auantl- 
taMve measure o Information contained In an Instruction sequence or 
oTj^tn'tr^uctl^n^^ '''' ^^^^"^^ -^^"^^ ef?ectlven«s 
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